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ABSTRACT 

With the increased accuracy and angular scale coverage of the recent CMB experiments it 
has become important to include calibration and beam uncertainties when estimating cosmo¬ 
logical parameters. This requires an integration over possible values of the calibration and 
beam size, which can be performed numerically but greatly increases computation times. We 
present a fast and general method for marginalization over calibration-type errors by analyti¬ 
cal integration. This is worked through for the specific example of CMB calibration and beam 
uncertainties and the resulting formulae are practical to implement. We show how cosmo¬ 
logical parameter constraints from the latest CMB data are changed when calibration/beam 
uncertainties are taken into account: typically the best fit parameters are shifted and the errors 
bars are increased by up to fifty per cent for e.g. n s and fit,/! 2 ; although as expected there is 
no change for £2 k, because it is constrained by the positions of the peaks. 

Key words: cosmology:observations - cosmology:theory - cosmic microwave background 
- methods statistical 


1 INTRODUCTION 

Fluctuations in the cosmic microwave background (CMB) radiation 

on scales of fractions of a degree and larger are potentially a direct 
probe of the state of the universe 300,000 years after the big bang, 
modified by the geometry of the universe. If the initial fluctuations 

were Gaussian and structure formed by gravitational collapse then 
the angular power spectrum of the CMB contains much cosmolog¬ 
ical information, and is also easy to calculate using codes CMB- 
FAST (Seljak & Zaldarriaga 1999) and CAMB (Lewis, Challinor & 
Lasenby 1999). Therefore many experiments have been carried out 
to estimate its form, and the results from the second generation of 

CMB telescopes are eclipsing previous results. Recent work involv¬ 
ing parameter estimation from the CMB includes Wang, Tegmark 

& Zaldarriaga (2001), Netterfield el (2001), de Bernardis et al. 
(2001), Pryke et al. (2001), Stompor et al. (2001), Jaffe et al. 
(2001), Bridle et al. (2001), Kinney. Melchiorri & Riotto (2001), Le 
Dour et al. (2000). Lahav et al. (2000), Dodelson & Knox (2000), 
Melchiorri et al. (2000), Efstathiou (2000), Gawiser & Silk (1998) 
and Lineweaver (1998). 

CMB power spectrum results have significant calibration un¬ 
certainties, due either to uncertainty in the flux of the calibration 
source (e.g. Jupiter) or difficulties in its measurement by the exper¬ 
iment in question (e.g. a scan synchronous noise in measurements 
of the CMB dipole) or both. As a result the band power AT or AT 2 
estimates from any single experiment can be scaled up or down by 


some unknown factor. This calibration uncertainty is now of greater 
significance because of the increased precision of experiments: it is 
now of a similar size to the quoted random errors. In addition, be¬ 
cause of the correlations in errors that it introduces, the calibration 
uncertainty is not simple to take into consideration when extract¬ 
ing cosmological parameters from a CMB power spectrum. This 
contrasts with the case of a calibration uncertainty on a single data 
point, which can be approximately taken into account by adding 
the calibration uncertainty in quadrature with the random errors. 

A fast method in the literature for dealing with this uncertainty 
couples the marginalization over the calibration with that over the 
CMB power spectrum normalization (Ganga et al. 1997, Lange et 
al. 2001). However, this is non-trivial to extend to the case where 
several CMB data sets have independent and significant calibration 
uncertainties (~ 20 per cent in d T 2 for BOOMERANG, Netter¬ 
field et al. 2001, and ~ 8 per cent for MAXIMA-1, Lee et al. 2001, 
and DASI, Halverson et al 2001). Wang et al. (2001) account for 
the calibration uncertainty by using a method related to that pre¬ 
sented here, however the derivation is not well documented in the 
literature. Frequently the marginalisation is carried out numerically, 
which is time consuming. Here we present the full derivation of a 
fast method, in which the calibration correction for a single data set 
is marginalised over analytically. This takes no more computation 
time than when the calibration uncertainty is ignored. 

In addition to a calibration uncertainty, the experiments such 
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as BOOMERANG and MAXIMA suffer from beam uncertainties. 
Many systematics can produce an uncertainty in the reconstruc¬ 
tion of the instrumental beam. For example, in the case of the 
BOOMERANG experiment the main contribution to the beam un¬ 
certainty is due to the pointing uncertainty. Pointing uncertainty 
leads to beam type uncertainties because temperature measure¬ 
ments are not made continually, but instead are averaged over some 
discrete time period. High frequency jitter in the pointing means 
that each discrete measurement includes pointings over a region of 
the sky. The effective beam of an experiment reflects the area of 
the sky being sampled in a given time step, so the true beam must 
be convolved with the area sampled in a given time step. Thus, 
jitter translates into a larger effective beam and uncertainty in the 
amount of jitter can be treated as uncertainty in the effective beam 
size. The impact on the CMB power spectrum is to introduce an 
‘angular-scale-dependent’ error. For conciseness in the rest of this 
paper, we refer to the combined effects of pointing and beam un¬ 
certainties simply as beam uncertainty. 

To include both calibration and beam uncertainties is com¬ 
putationally costly. In general the beam uncertainty is integrated 
over numerically, thus the computation time is increased by a factor 
equal to the number of integration steps used. In this paper we show 
how an ‘angular-scale-dependent’ uncertainty such as the beam er¬ 
ror can be marginalised over analytically assuming a Gaussian prior 
on the size of the correction. The combined analytic calibration and 
beam marginalisation takes two to three times as long to calculate 
as when there are no such uncertainties, and has already been used 
in Lahav et al. (2001) and Bean et al. (2001). 

This method follows the general approach of marginalizing 
over nuisance parameters analytically discussed, for example, in 
Gull (1989), Sivia (1996) and Lahav et al. (2000). As discussed 
in this introduction, this work is motivated by the recent CMB 
data sets. However, the fast marginalisation over a calibration type 
parameter should be much more widely applicable in astronomy. 
Therefore in Section 2 we show the analytic marginalised result 
for the general case of a correlated beam uncertainty, reserving 
the mathematical derivation for the Appendix. Section 3 gives fast 
computational versions of the formulae for the special cases of 
CMB calibration alone and CMB calibration and beam uncertain¬ 
ties. In Section 4 we illustrate the effect of the marginalisation by 
applying them to the latest CMB data. 


2 ANALYTIC MARGINALIZATION 
2.1 General approach 


depends on how the nuisance parameter affects the observations. 
As described below, we can model these corrections by a factor 
linear in the parameter times some ‘template’ x b , so the predictions 
for the full observations are 


X P' = x p +bx b . 


(1) 


Here, the vector x b is assumed to be an arbitrary function of the 
predictions, but not to depend on the data or its noise correlation 
matrix. The parameter b can be thought of as a measure of how far 
the nuisance parameter deviates from its expected mean value. 

As an example consider the calibration uncertainty on the 
CMB temperature angular power spectrum. In this case the x° and 
x p are the observed and predicted AT 2 bandpower measurements. 
The predictions could be scaled up and down by a factor around 1 
therefore x* = x p . 

If the observations are subject to some noise drawn from a 
multivariate Gaussian distribution with correlation matrix , then 
the probability of the data given a model (the model’s likelihood) 
is given by 


Pr(x°|7V,x p ,A,(>) = AV exp 


-^(x°-x p ') 1 5Vr 1 (x°-x p ') 


( 2 ) 


where = (2jr)^' ! / 2 |rA£ | -1 / 2 and n is the number of data points. 
(See the start of Section 3 for a discussion about the validity of 
the Gaussian assumption for CMB band power measurements.) To 
obtain the likelihood of the data independent of the calibration type 
errors, we must marginalise over b 

Pr(x°|!A£,x p y,a b ) = |Pr(x°, b\cM , x p , x b , a b )db (3) 

= f Pr(x°|A:,x p ,x i , A)Pr(A|a b )db, (4) 

using Bayes theorem and assuming P(b\d\i , x p , x b , o b ) = P(b |o b ). 

It is not always clear what form the prior on the calibration 
parameter b should take. However, if this prior has a simple form, 
then this marginalisation can often be performed analytically. Here, 
we assume the prior is a Gaussian of width O/,, 

p(&iob)= (i^)W exp [ _fc2/2 ^]- (5) 

If we were to instead assume a top hat prior on b then the analytic 
marginalisation can still be carried out, but leads to error functions, 
and is thus more complicated to implement. 

Substituting Eq. 2 and Eq. 5 into Eq. 4, gathering up terms 
in powers of b, completing the square and integrating over b (see 
Appendix), one finds 


Before addressing the specific problems of marginalizing over cal¬ 
ibration and beam uncertainties, let us examine the more general 
problem of ‘nuisance’ parameters. Quite often, observational re¬ 
sults will depend on parameters of the measurement which are not 
precisely known and whose value is not of great interest to us. In 
such cases, it is useful to determine a likelihood function which 
folds in the effects of the uncertainties in these nuisance parame¬ 
ters by marginalizing over their possible values. The resulting like¬ 
lihood no longer directly depends on the nuisance parameter, but 
its uncertainties become incorporated in the modified data and its 
correlation matrix. 

In many problems, we can model the observational data, x°, 
by the predictions from an underlying theory, x p , with a correction 
arising from some uninteresting (for the present purposes) nuisance 
parameter b. Thus, the predictions are modified in some way which 


Pr(x°|?\(, x p , x b , a b ) = N M exp 


^(x°-x p ) T 5W'- 1 (x°-x p )i6) 


^ -—-. 

x b 5\£ “ 1 x* + o b _ 


(7) 


It is straightforward to show that 


CM = +Ct b x fc x iT , 


(8) 


which is the Sherman-Morrison result (see e.g. Press et al. 1992). 
Note that Eq. 6, 7 is an exact result, and does not, for example, rely 
on a Taylor expansion. This calculation looks almost the same as 
when the nuisance parameters are not taken into account (x p/ = x p 
Eq. 2), except that the matrix has been replaced by the matrix 
CM , which could be a function of the predicted data x p . Note that. 
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even in the case where correlated errors between data points are as¬ 
sumed to be negligible, i.e. “ 1 is diagonal, the new matrix At ~ 1 
is not diagonal. The marginalization effectively introduces a corre¬ 
lated error between the data points, adding error to the collective 
modes which are affected by the nuisance parameter. 

This technique has a broad array of applications in the vari¬ 
ous stages of CMB analysis, from map making to foreground re¬ 
moval (see e.g. Bond & Crittenden, 2001). For example, it is used 
to remove the effects of foreground contaminants from maps such 
as the galaxy. In previous applications, however, the templates re¬ 
moved were independent of the theory, so that the correction to the 
noise matrix must be done only once. However, in problems such as 
calibration marginalization, the templates depend on the theoretical 
predictions, so the effective weight matrix must be re-evaluated for 
each theory. As we show below, however, this does not significantly 
slow down the evaluation of the likelihoods. 

Nothing in the above is specific to the CMB, so in fact it could 
also be used a variety of applications, such as the uncertain nor¬ 
malisation of the matter power spectrum resulting from Lyman—a 
forest measurements (e.g. Croft et al. 2001). 

2.2 Marginalization over multiple parameters 

Often there are several nuisance parameters which must be 
marginalised over simultaneously. The above process can be re¬ 
peated several times, building up more complicated matrices At . 
As an example, the CMB bandpower measurements are often sub¬ 
ject to both calibration and beam uncertainties which must be 
marginalised over. (The result for CMB calibration and beam un¬ 
certainties is given in the following Section.) This may also be 
occur where there are several experiments each with internal and 
correlated external calibration uncertainties, as discussed in Knox 
& Page (2000), and Tegmark & Zaldarriaga (2001). One can ex¬ 
pand this set even further, including some simple cosmological pa¬ 
rameters. In particular, the overall theory normalization can be per¬ 
formed in the same way. 

When applying this technique to the marginalization over 
many non-orthogonal parameters simultaneously, it can be useful 
to use the block form of the above result. The Woodbury formula 
(also Press et al. 1992) can be used 

(*t+xx T y l = +x T ^~ l xr 1 x T ^ l }A9) 

where X is a m x n matrix containing m templates of n elements 
each. Note that this requires only the inversion of a m x m matrix, 
where m is the number of parameters being marginalised over. This 
formula can be much easier to implement than repeated application 
of the Sherman-Morrison result once the number of parameters be¬ 
ing integrated over exceeds two. 

In the case where more than one variable is marginalised over, 
the corresponding normalization factor generalizes to. 

N m = #* [det(l+X 7 '5\r 1 X)]“ 1/2 . ( 10 ) 

Again, this only requires the evaluation of the determinant of an 
m x m matrix, which is computationally inexpensive especially 
given that this matrix must already be inverted. While sometimes 
this normalization factor can be ignored as an overall prefactor, it 
cannot be dropped when the the templates are theory dependent. 

These techniques have been used in the analysis of the COBE 
data, where there was the possibility that the data were con¬ 
taminated by a quadrupole of unknown amplitude. This was ac¬ 
counted for by marginalisation over the possible amplitude of the 


quadrupoles, by modifying the noise correlation matrix using the 
Woodbury formula (Bond, Jaffe & Knox 1998). 


3 PRACTICAL IMPLEMENTATION 

In this Section, the above equations are rewritten in a more com¬ 
putationally practical way for the specific case of CMB calibration 
and beam uncertainties in power spectrum measurements. 

The analytic technique described above assumes that the ob¬ 
served data are Gaussianly distributed. This is clearly only an ap¬ 
proximation for band power measurements, as the power spectrum 
must be positive. When there are only a few modes in a band power 
measurement, a better approximation is to assume a log-normal 
probability distribution (Bond, Jaffe & Knox, 2000). However, if 
there are sufficiently many modes and the error bars are signif¬ 
icantly smaller than the measured band powers, then the Gaus¬ 
sian approximation should be reasonable and the techniques we 
describe can be applied. 

3.1 Calibration Uncertainty 

First we consider the case of a calibration uncertainty in the CMB 
temperature anisotropy angular power spectrum. The data x° are 
the CMB AT 2 bandpower measurements and the x p are the pre¬ 
dicted bandpowers. In the case of a calibration uncertainty, the pre¬ 
dictions are those expected from the underlying model, multiplied 
by some unknown factor, c which is the potentially incorrect ex¬ 
perimental calibration, thus the predictions become x^ = cx p . For 
example, BOOMERANG has a calibration uncertainty of 20 per 
cent in AT 2 and therefore c ranges from roughly 0.8 to 1.2. 

The calibration marginalization is identical to the general case 
discussed above if we take b = c — 1 and x b = x p . For clarity in 
the following subsections we will write the calibration uncertainty 
as a c , but it can be identified with Ob, since b and c have identical 
distributions (though they are about a different mean.) Therefore 

iM = At +a^x p x pT . (11) 

Note that since the the power spectrum measurements, and there¬ 
fore c, are positive, we must have o c <C 1 for the Gaussian prior for 
the calibration to be a reasonable approximation. 

Note that this is identical to the formulation used in Wang et 
al. (2001) except that they use the approximation that the theory x p 
is nearly equal to the data x° to make a theory independent correc¬ 
tion to the noise matrix. This is performed just once rather than for 
each theory, making it as fast as if the calibration uncertainty was 
ignored. Formally, this is almost equivalent to applying the cali¬ 
bration correction to the data, but not their error bars. In practical 
terms we find that using x° instead of x p makes a shift of a few per 
cent in estimated parameter values. (The same remarks apply to the 
beam marginalisation in the following subsections.) 

If we instead attempt to use the exact expression of Eq. 11 
then, because the matrix At (or more importantly, its inverse) is 
a function of the predicted quantities, x p , it must be re-calculated 
for each underlying model and this can be very time consuming. 
(Not necessarily as time consuming as evaluating the predictions 
for a given theory, but here we assume these are already known.) 
For the simple calibration uncertainty case, a fast computational 
implementation is to calculate in advance the quantities 

v 0 = AT'x 0 (12) 

ioo = x oT a:" 1 x 0 . (13) 
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Then, dropping terms which are independent of the model predic¬ 


tions, the effective chi-squared defined by 

Pr(x°|5\£,x p ) °c exp[-x?/2] (14) 

is given by 

Xc = ioo-2Spo+ipp-(5po-ipp) 2 Ac+log(i c O") (15) 

■Spo = x pT vi (16) 

ipp = x pT !A£ _1 x p (17) 

$c = ^pp H 2 ■ (18) 


If the covariance matrix is not diagonal, the matrix product, 
Eq. 17, scales as n 2 and this dominates the time required to cal¬ 
culate x~- If the covariance matrix is diagonal, the matrix product 
and the vector products are both linear in n and take comparable 
amounts of time. In either case however, the effects of marginaliz¬ 
ing over the calibration (the last two terms in Eq. 15) use the same 
factors required to calculate the naive x 2 , so this process takes no 
more computation time. 

3.2 Beam Uncertainty 

Measurements of the cosmic microwave background radiation an¬ 
gular power spectrum are often hampered by an uncertainty in the 
telescope beam. This leads to correlated errors in the estimates of 
power spectrum analogous to those of the calibration uncertainty, 
with the added complication that the corrections grow as the an¬ 
gular scale becomes smaller. Marginalizing over these can be per¬ 
formed in the same way, but now x h = Ax p , where A is a diago¬ 
nal matrix which increases for higher multipole band powers. The 
precise nature of this matrix depends on the nature of the beam 
shape and ‘jitter" uncertainties, and is best estimated by the exper¬ 
imental teams. For example, the BOOMERANG and MAXIMA 
teams give 1 — a beam uncertainties, dx° (for BOOMERANG see 
Fig. 2 of Netterfield et al. 2001; for MAXIMA-1 we add in quadra¬ 
ture the beam and pointing contributions given in Table 1 of Lee et 
al. 2001). The matrix A is given by the percentage errors for each 
bandpower, that is, 

A = diag(dx? /x?) (19) 

where x° are the derived bandpowers and the normalization is such 
that CTj, = I. 

The nature of the beam uncertainties, at least on scales larger 
than the beam, can be understood qualitatively by considering the 
case of a Gaussian beam whose angular size is not well known. 
For a beam of true size 0 and which has been misestimated to be 
0(), the predicted bandpower x? at mean multipole i, must be is 


transformed by the relation 


xf = jc? exp |V 2 (0 q — 0 2 )j 

(20) 

* Af (l+^(05-O 2 )) 

(21) 


(assuming the estimate is not too far from the truth.) Thus if the 
beam is assumed to be too small, the inferred band power will be 
smaller than the true band power. The error in the measurement 
will increase for band powers at higher t. If the beam uncertainty 
is small compared to its mean size (a@ << 0o) and is Gaussianly 
distributed, then (0 2 — 0g) will also be Gaussian distributed, with a 
width of c e : = 2oe0(). The marginalization then can be performed 
exactly as derived in Section 2, with G/, = Oqz and A = diag(T?), 


where (; is the mean multipole of the z' th band. In practice, this is 
very close the scaling of beam uncertainties given by the experi¬ 
ments such as BOOMERANG and MAXIMA. 

Fast marginalization over the beam uncertainty alone may be 
carried out in a similar way to that over the calibration uncertainty. 
However, since the recent experiments with a beam uncertainty also 
carry a calibration uncertainty, we proceed straight to marginaliza¬ 
tion over both the calibration and beam uncertainties simultane¬ 
ously. 

3.3 Calibration and Beam Uncertainty 


The equations for fast computation use the quantities already de¬ 
fined in Eq.s 13, 15 to 18 and 19. The calculation can be speeded 
up by advance calculation of 


v fl 

= 

a t nL 'x 0 

(22) 


Qa 

= 

-t'V ” 1 

(23) 


Qaa 

= 

a t c\C~ 1 a. 

(24) 


The effective chi-squared defined by 



Pr(x° 

H, 

x p ,o b , A,a c ) °C exp -Xch / 2 

(25) 


is given by 



lib 

= 

(.Sao Sap (s p o ^pp)^ap/ Sc)^ 

l+Saa-slp/Sc 

-+log(l + Sao- 


Sao 

= 

X pT V fl 


(27) 

Sap 

= 

x pT QflX p 


(28) 

Saa 

= 

x pT QflflX p . 


(29) 


This takes two to three times as long to calculate as when there are 
no calibration or beam uncertainties, depending on whether is 
diagonal (and assuming n is large). 


4 APPLICATION TO DATA 

It has been made clear by the experimental teams that calibration 
and (where applicable) beam uncertainties must be included in any 
analysis of the data presented. However it is very tempting to save 
considerable computer time by ignoring these uncertainties. In this 
section we apply the formulae presented above to CMB data, in¬ 
vestigating how big a difference the inclusion of calibration and 
beam uncertainties make to parameter estimation. We consider the 
latest BOOMERANG and COBE data alone, for simplicity and be¬ 
cause this data set has the largest calibration and beam uncertain¬ 
ties. The power spectrum from this experiment was estimated in 
19 bins spanning the range 75 ^ ^ 1050. Since no information 

on the BOOMERANG window functions and full covariance ma¬ 
trix is publicly available yet, we assign a top-hat window function 
for the spectrum in each bin and neglect correlations between bins. 
This approach is a good approximation of the correct one (see de 
Bernardis et al. 2001) and does not affect our conclusions. For our 
marginalisations over the beam uncertainties, we took the 1 — a er¬ 
ror bars from Fig. 2 of Netterfield et al. (2001). We also include 
the COBE data using the RADPack packages (Dodelson & Knox 
2000 ). 

The theoretical models are computed using the publicly avail¬ 
able CMBFAST code (Seljak & Zaldarriaga). The ranges of our 
database of models, or equivalently top hat priors, are 0.1 < Q.,„ < 
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1.0, 0.015 < £l b < 0.2, 0.0 < Q. a < 1.0 and 0.25 < h < 0.95. We 
define h = Hq/( 100 kms^’Mpc -1 ) throughout. We vary the spec¬ 
tral index of the primordial density perturbations within the range 
0.5 < n s <1.5 and we re-scale the amplitude of fluctuations by a 
pre-factor C\q. We also assume an external Gaussian prior on the 
Hubble parameter h = 0.70 ±0.1 and limit the analysis to models 
with age to > 10 Gyrs (see, e.g. Ferreras, Melchiorri, Silk 2001). 
It is important to note that the constraints we will derive on the 
various parameters are heavily affected by the size of our database 
and by the priors assumed. Considering a background of gravita¬ 
tional waves or a different optical depth of the universe, for exam¬ 
ple, would change our constraints. Here we illustrate the effect of 
the CMB systematics on just the simplest models. 

We find that neglecting the calibration error the scalar spec¬ 
tral index n s = 0.91 ±0.04, while when including calibration n s = 
0.89 ±0.06. Even though these numbers are compatible, it is im¬ 
portant to notice that a scale invariant n s = 1 power spectrum is 
excluded at 2a in the first case, while is still inside two standard 
deviations when the calibration error is included. 

We consider the constraints on the physical baryon density pa¬ 
rameter £l b lr. The baryon density plays a crucial role in the deter¬ 
mination of the relative amplitude of the peaks in the power spec¬ 
trum and could therefore be significantly affected by beam uncer¬ 
tainty. This is particularly true if only the first two Doppler peaks 
are well constrained by the data. (The determination of a third peak 
would likely break this degeneracy.) Neglecting beam uncertainty 
one obtains the tight constraint Q. b lr = 0.022 ±0.004, excluding a 
low f l b lr ~ 0.010 region which is still compatible with some Big 
Bang Nucleosynthesis data at more than 2o, while including the 
beam error one infers Q. b lr = 0.020 ± 0.006. 

We find that the constraint on the universe curvature is insen¬ 
sitive to whether beam and calibration uncertainties are taken into 
account, which is perhaps to be expected since the curvature af¬ 
fects the acoustic peak positions, rather than their amplitudes. We 
also found that the constraints on the cold dark matter physical den¬ 
sity remain at Q. c h~ = 0.13±0.05 whether or not calibration and/or 
beam uncertainties are taken into account. 

The fast method presented in this Paper is crucially important 
when combining multiple experiments, each of which may have in¬ 
dependent calibration and beam uncertainties. These introduce new 
unknown parameters and current popular methods are severely af¬ 
fected by an increase in the number of free parameters. However 
note that Markov Chain Monte Carlo methods (see Knox, Chris¬ 
tensen & Skordis, 2001) are much less affected by increases in the 
number of free parameters. 

Using the formulae presented here, the jy cb values for each ex¬ 
periment are simply added together. Thus the computational time 
is still only a few times longer than that when calibration and beam 
uncertainties are ignored completely. To illustrate this we com¬ 
bined the latest BOOMERANG, MAXIMA, DASI and COBE data 
to obtain cosmological parameter constraints. We find that the re¬ 
sults on the scalar spectral index are n s = 0.91 ± 0.02 if we do 
no take in to account calibration and beam uncertainties, while 
we obtain a similar value but larger error bars n s = 0.91 ±0.04 
when the above systematics are considered. However, we found 
a b h 2 = o.oi9™ independent of whether calibration and beam 
uncertainties are taken into account. This is mainly due to the ab¬ 
sence of beam uncertainty for DASI. 

For the BOOMERANG plus COBE analysis we also compare 
with a numerical marginalization, in which the calibration error is 
‘maximised' over and the beam error is marginalised over in seven 
integration steps. We find very consistent results, thus the analyses 


carried out by the experimental teams using this type of numeri¬ 
cal approach can be relied on. However note that when combining 
more data sets each with independent calibration (and beam) un¬ 
certainties this numerical method is no longer practical. 


5 CONCLUSION 

Our result for the analytic marginalization over calibration uncer¬ 
tainty is simple, easy to implement and fast. In general, a numerical 
marginalization (integration) over calibration and beam increases 
the computation time by a factor equal to the number of integration 
steps squared. Inclusion of calibration uncertainties by analytical 
methods does not increase computation times, and adding in beam 
uncertainties leads to a further increase by a factor of only two or 
three. This is true irrespective of the number of different data sets, 
each with their own independent calibration and beam uncertain¬ 
ties. 

We have shown that marginalization over the calibration and 
beam uncertainties can make a significant difference to parameter 
estimation, particularly in widening the error bars on some param¬ 
eters as much as fifty per cent. We verify that the constraint on the 
universe curvature is unaffected by the inclusion of the calibration 
and/or beam uncertainty, but that the physical density of baryons 
or the spectral index of primordial fluctuations n s can be signifi¬ 
cantly affected, allowing significantly lower values for Q. b lr and 
widening the error bars for n s . 

In summary, we show that calibration and beam uncertainties 
should be taken into account and we present a method which allows 
this to be done exactly and with little extra work. 
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APPENDIX A 

We substitute Eq. 2 and 5 into Eq. 4 and collect up terms in b, 
assuming _1 is symmetric. We then complete the square and use 
the standard result for the integral over a Gaussian. 


P(x°|iA£,x p , j?,o b ) = 


N, 


K 


\/2na b 


N, 


A 


\/2na b ■ 


j dbexp —^(x° — (x p + bx b )) T 5V© 1 ( x ° — (x p +bx b )) 

J dbexp -i((x°-x p ) T ; Ar 1 (x 0 -x p )-2(x 0 -x p ) T 9V:- 1 x b b+(: 


2°6 


x bT iA£ 'xh + cv 


N, 


A 


exp 


V^icTb 

x J dbexp 


-^(x°-x p ) T 3V: _1 (x 0 -x p ) 


T _2 (x°-X p ) T 5V;" 1 X b \ t C(v°-'rf , V I V 1 v b 'l 

x b l x bq- Ch “b+ ' 


yx^ir 


'xb+Ou 2 , 


= N m exp 


2 \ x b yl~ l x b + a h 2 


((x°-x p ) T 5v;” 1 x b ) 

x bT 5v;- 1 x b +or 2 


where, 

N m = (27t)-"/ 2 |5W|- 1 / 2 = (2jl)-"/ 2 |5V:+ogx*x feT |- 1 / 2 

= N K (l+^slC-Vog)" 172 . (31) 

This result is equivalent to Eq. 6. 



(30) 
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APPENDIX B 


Above, we assumed a particular form of =x p + (b — b)x b , where 
for simplicity we defined b such that b = 0. Here we will try to jus¬ 
tify this choice, show why it often arises and discuss some simple 
forms that x b may take. 

In general, the data could be an arbitrary function of the the¬ 
oretical predictions and the nuisance parameter, i.e. x p/ = F/,(x p ). 
However, often the raw measurements, x° and , are not reported 
directly. Instead, experimentalists give results which may be com¬ 
pared to the theoretical predictions by assuming the best estimate 
of the nuisance parameter, b = b. They solve then for the inferred 
observations given that value to find x 0 ' = Fg -1 (x°), and solve for 
its noise correlation matrix in the same way. The likelihoods de¬ 
rived using this new variable are equivalent to those derived using 
the original variables. While the variables are rescaled, so are their 
inferred noise levels, leaving the likelihoods unchanged. 

The theoretical predictions for these new inferred observables 
are 


Fr'F^xP) = F S -*F - h (xP) + (b-b) 


dFr- l F h (xn 


= x p 


+ (b — b) 


- aF r >F i( x p ) 


3 b 


3 b 


b=b 


+ ... 


b=b 


(32) 


where the first relation follows front a Taylor expansion. We can 
now see that the form of x p/ used above arises naturally when we 
are considering the x^ variables and we can associate x b with a 
particular function of F/,. If the higher order terms become impor¬ 
tant for reasonable values of b, then the analytic methods described 
here will not be applicable. This will depend on how well the nui¬ 
sance parameter is known. In addition, it often happens that F* is 
linear in b, where these higher order terms are identically zero. 

Let us consider some simple examples of transformations F/, 
and the form of the x b vectors assoiated with them. If we take a 
simple displacement, F;,(x p ) = x p + bx b , where x f is a fixed tem¬ 
plate independent of the predictions, then it is easy to show that 
Fj _1 Fi,(x p ) = x p + (b — i)x f , so x b = x f . We can also consider a 
simple multiplicative factor F^(x p ) = Z;x p , as might arise from sim¬ 
ple calibration uncertainties. Then Fj _1 Ffc(x p ) = x p + (b — b)xP/b. 
We can absorb the l/b into the definition of b , so that x b = x p . 

Finally, consider the more complicated case where F/,(x p ) = 
A(b)xP, where A(b) is an n x n matrix. Such is the case for 
beam uncertainties, where what is measured are the band pow¬ 
ers of the map smoothed by the beam and one is attempting to 
determine the band powers of the unsmoothed map. In this case, 
Fg _1 Ffc(x p ) = x p + (b — b)A [b)~ x A 1 (b)xV. Thus, x b is some n x n 
matrix times x p . In the beam example, if the beam is Gaussian, 
A ( b) = diag(e~ fe 7), so x b = —diag((!?)x p . In this example, we have 
ignored higher order terms in (b — b), which may be important for 
band powers at high t, where the uncertainties caused by the poorly 
determined beam are of order the size of the measurements. 
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